WO 97/48819 



-73- 



PCTYUS97/10376 



' SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(I) APPLICANT: 

(A) NAME: THE SCRIPPS RESEARCH INSTITUTE 

(B) STREET: 10550 North Torrey Pines Road 

(C) CITY: La Jolla 

(D) STATE:. California 

(E) COUNTRY: US 

(F) ZIP: 92037 

(G) TELEPHONE: (619) 784-2937 

(H) TELEFAX: (619) 784-9399 

(ii) TITLE OF INVENTION: CASSAVA VEIN MOSAIC VIRUS PROMOTERS AND 
USES THEREOF 

(iii) NUMBER OF SEQUENCES: 36 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US 97/ 

(B) FILING DATE: 20-JUN-1997 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/020,129 

(B) FILING DATE: 20-JUN-1996 

(2) INFORMATION FOR SEQ ID NO:l: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
IrilS AGCTCAGCAA GAAGCAGATC AATATGCGGC ACATATGCAA CCTATGTTCA AAAATGAAGA 6 0 

i y 

t i 

jy ATGTACAGAT ACAAGATCCT ATACTGCCAG AATACGAAGA AGAATACGTA GAAATTGAAA 120 

«3 AAGAAGAACC AGGCGAAGAA AAGAATCTTG AAGACGTAAG CACTGACGAC AACAATGAAA 18 0 

□ 20 

N 5 AGAAGAAGAT AAGGTCGGTG ATTGTGAAAG AGACATAGAG GACACATGTA AGGTGGAAAA 240 

O TGTAAGGGCG GAAAGTAACC TTATCACAAA GGAATCTTAT CCCCCACTAC TTATCCTTTT 300 

25 ATATTTTTCC GTGTCATTTT TGCCCTTGAG TTTTC CTATA TAAGGAACCA AGTTCGGCAT 360 
TTG TGAAAAC AAGAAAAAAT TTGGTGTAAG CT 392 
(2) INFORMATION FOR SEQ ID NO : 2 : 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
3 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 
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5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGTAC C AGAA GGTAATTATC CAAGATGTAG CATCAAGAAT CCAATGTTTA CGGGAAAAAC 60 
TATGGAAGTA TTATGTGAGC TCAGCAAGAA G C AG ATCAAT ATGCGGCACA TATGCAACCT 12 0 

10 

ATGTTCAAAA ATGAAGAATG TACAGATACA AGATCCTATA CTGCCAGAAT ACGAAGAAGA 18 0 

ATACGTAGAA ATTGAAAAAG AAGAACCAGG CGAAGAAAAG AATCTTGAAG ACGTAAGCAC 24 0 

o 

fy 15 TGACGACAAC AATGAAAAGA AGAAGATAAG GTCGGTGATT GTGAAAGAGA CATAGAGGAC 3 00 

fS ACATGTAAGG TGGAAAATGT AAGGGCGGAA AGTAACCTTA TCACAAAGGA ATCTTATCCC 360 

OJ 

CCACTACTTA TCCTTTTATA TTTTTCCGTG TCATTTTTGC CCTTGAGTTT TCCTATATAA 42 0 

□ 20 

L.; GGAACCAAGT TCGGCATTTG TGAAAACAAG AAAAAATTTG GTGTAAGCTA TTTTCTTTGA 4 80 

'£ ~<t 

D AGTACTGAGG ATACAAGTTC AGAGAAATTT GTAAGTTTGA ATTC 524 

2 5 (2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 60 

5 ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 

CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 180 

GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 24 0 

10 

ACTGACGACA ACAATGAAAA GAAGAAGATA AGGTCGGTGA TTGTGAAAGA GACATAGAGG 300 

ACACATGTAA GGTGGAAAAT GTAAGGGCGG AAAGTAAC CT TATCACAAAG GAATCTTATC 360 

Hjl5 CCCCACTACT TATCCTTTTA TATTTTTCCG TGTCATTTTT GCCCTTGAGT TTTCCTATAT 420 

G 

AAGGAACCAA GTTCGGCATT TGTGAAAACA AGAAAAAATT TGGTGTAAGC TATTTTCTTT 480 

i 1 1 

GAAGTACTGA GGATACAAGT TCAGAGAAAT TTGTAAGTTT GAATTC 526 

O 20 

Jf" (2) INFORMATION FOR SEQ ID NO : 4 : 

Q (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 411 base pairs 

25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



30 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



GGATCCTATG TTCAAAAATG AAGAATGTAC AGATACAAGA TCCTATACTG CCAGAATACG 6 0 
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AAGAAGAATA CGTAGAAATT GAAAAAGAAG AACCAGGCGA AGAAAAGAAT CTTGAAGACG 12 0 

TAAGCACTGA CGACAACAAT GAAAAGAAGA AGATAAGGTC GGTGATTGTG AAAGAGACAT 180 

5 AGAGGACACA TGTAAGGTGG AAAATGTAAG GGCGGAAAGT AACCTTATCA CAAAGGAATC 240 

TTATCCCCCA CTACTTATCC TTTTATATTT TTCCGTGTCA TTTTTGCCCT TGAGTTTTCC 300 

TATATAAGGA ACCAAGTTCG GCATTTGTGA AAACAAGAAA AAATTTGGTG TAAGCTATTT 36 0 

10 

TCTTTGAAGT ACTGAGGATA CAAGTTCAGA GAAATTTGTA AGTTTGAATT C 411 
(2) INFORMATION FOR SEQ ID NO : 5 : 

jl5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



B 20 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



25 (iv) ANTI- SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

30 

GGATCCTGAA GACGTAAGCA CTGACGACAA CAATGAAAAG AAGAAGATAA GGTCGGTGAT 6 0 



TGTGAAAGAG ACATAGAGGA CACATGTAAG GTGGAAAATG TAAGGGCGGA AAGTAACCTT 12 0 



3 5 ATCACAAAGG AATCTTATCC CCCACTACTT ATCCTTTTAT ATTTTTCCGT GTCATTTTTG 180 



CCCTTGAGTT TTCCTATATA AGGAACCAAG TTCGGCATTT GTGAAAACAA GAAAAAATTT 240 



GGTGTAAGCT ATTTTCTTTG AAGTACTGAG GATACAAGTT CAGAGAAATT TGTAAGTTTG 300 
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AATTC 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



305 



10 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

q 

HI 15 (iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



□ 20 

H' GGATCCGGTC GGTGATTGTG AAAGAGACAT AGAGGACACA TGTAAGGTGG AAAATGTAAG 6 0 



LJ 



261 



GGCGGAAAGT AACCTTATCA CAAAGGAATC TTATCCCCCA CTACTTATCC TTTTATATTT 120 

25 TTCCGTGTCA TTTTTGCCCT TGAGTTTTCC TATATAAGGA ACCAAGTTCG GCATTTGTGA 180 

AAACAAGAAA AAATTTGGTG TAAGCTATTT TCTTTGAAGT ACTGAGGATA CAAGTTCAGA 24 0 
GAAATTTGTA AGTTTGAATT C 

30 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 



(iv) ANT I - SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



GGATCCTTAT CACAAAGGAA TCTTATCCCC CACTACTTAT CCTTTTATAT TTTTCCGTGT 



60 



CATTTTTGCC CTTGAGTTTT CCTATATAAG GAACCAAGTT CGGCATTTGT GAAAACAAGA 



120 



AAAAATTTGG TGTAAGCTAT TTTCTTTGAA GTACTGAGGA TACAAGTTCA GAGAAATTTG 



180 



TAAGTTTGAA TTC 



193 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GGATC CGTGT CATTTTTGCC CTTGAGTTTT CCTATATAAG GAACCAAGTT CGGCATTTGT 6 0 

GAAAACAAGA AAAAATTTGG TGTAAGCTAT TTTCTTTGAA GTACTGAGGA TACAAGTTCA 120 
GAGAAATTTG TAAGTTTGAA TTC 143 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
<iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATC CAATGTT TACGGGAAAA 60 

ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGGATC 120 

CTGAAGACGT AAGCACTGAC GACAACAATG AAAAGAAGAA GATAAGGTCG GTGATTGTGA 180 

AAGAGACATA GAGGACACAT GTAAGGTGGA AAATGTAAGG GCGGAAAGTA ACCTTATCAC 24 0 

AAAGGAATCT TATCCCCCAC TACTTATCCT TTTATATTTT TCCGTGTCAT TTTTGCCCTT 3 00 

GAGTTTTCCT ATATAAGGAA CCAAGTTCGG CATTTGTGAA AACAAGAAAA AATTTGGTGT 36 0 

AAGCTATTTT CTTTGAAGTA CTGAGGATAC AAGTTCAGAG AAATTTG TAA GTTTGAATTC 42 0 

(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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( D ) TOPOLOGY : 1 ineair 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 6 0 

15 ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 

CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 18 0 

GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGGATCCGGT CGGTGATTGT 24 0 

20 

GAAAGAGACA TAGAGGACAC ATGTAAGGTG GAAAATGTAA GGGCGGAAAG TAACCTTATC 3 00 

ACAAAGGAAT CTTATCCCCC ACTACTTAT C CTTTTATATT TTTCCGTGTC ATTTTTGCCC 360 

25 TTGAGTTTTC CTATATAAGG AACCAAGTTC GGCATTTGTG AAAACAAGAA AAAATTTGGT 420 

GTAAGCTATT TTCTTTGAAG TACTGAGGAT ACAAGTTCAG AGAAATTTGT AAGTTTGAAT 480 

TC 4 82 

30 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 58 base pairs 
3 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 60 

10 

ACTATGGAAG TATTATGTGA GCT CAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 

_ CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA T ACTGC C AG A ATACGAAGAA 18 0 

WlS GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 24 0 

i y 

CD ACTGACGACA ACAATGAAAA GAAGAGGATC CTTATCACAA AGGAATCTTA TCCCCCACTA 300 

H . t 

g CTTATCCTTT TATATTTTTC CGTGTCATTT TTGCCCTTGA GTTTTCCTAT ATAAGGAACC 360 

O 20 

n\ AAGTTCGGCA TTTGTGAAAA CAAGAAAAAA TTTGGTGTAA GCTATTTTCT TTGAAGTACT 42 0 

JSf GAGGATACAA GTTCAGAGAA ATTTGTAAGT TTGAATTC 458 

25 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

35 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 60 

5 ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 

CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 18 0 

GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 24 0 

10 

ACTGACGACA ACAATGAAAA GAAGAAGATA AGGTCGGATC CTTATCACAA AGGAATCTTA 3 00 

TCCCCCACTA CTTATCCTTT TATATTTTTC CGTGTCATTT TTGCCCTTGA GTTTTCCTAT 36 0 

PLJ15 ATAAGGAACC AAGTTCGGCA TTTGTGAAAA CAAGAAAAAA TTTGGTGTAA GCTATTTTCT 42 0 

fjj TTGAAGTACT GAGGATACAA GTTCAGAGAA ATTTGTAAGT TTGAATTC 46 8 

^ (2) INFORMATION FOR SEQ ID NO: 13: 

Q20 

(i) SEQUENCE CHARACTERISTICS: 

I U 

(A) LENGTH: 491 base pairs 
y (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS : single 

25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

30 

(iv) ANTI- SENSE: NO 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 6 0 

ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 120 
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CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 18 0 

GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 24 0 

5 ACTGACGACA ACAATGAAAA GAAGAAGATA AGGTCGGTGA TTGTGAAAGA GACATAGAGG 300 

ATCCTTATCA CAAAGGAATC TTATCCCCCA CTACTTATCC TTTTATATTT TTCCGTGTCA 360 

TTTTTGCCCT TGAGTTTTCC TATATAAGGA ACCAAGTTCG GCATTTGTGA AAACAAGAAA 42 0 

10 

AAATTTGGTG TAAGCTATTT TCTTTGAAGT ACTGAGGATA CAAGTTCAGA GAAATTTGTA 480 

AGTTTGAATT C 4 91 

if": 

njl5 (2) INFORMATION FOR SEQ ID NO:14: 



(i) SEQUENCE CHARACTERISTICS: 
| (A) LENGTH: 408 base pairs 

(B) TYPE: nucleic acid 
i20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATC CAATGTT TACGGGAAAA 6 0 

3 5 ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 



CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 18 0 



GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 240 
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ACTGACGACA ACAATGAAAA GAAGAGGATC CGTGTCATTT TTGCCCTTGA GTTTTCCTAT 300 

ATAAGGAACC AAGTTCGGCA TTTGTGAAAA CAAGAAAAAA TTTGGTGTAA GCTATTTTCT 36 0 

5 TTGAAGTACT GAGGATACAA GTTCAGAGAA ATTTGTAAGT TTGAATTC 4 08 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

jfL5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
<iv) ANTI-SENSE: NO 

s 

Q20 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

25 TCT AG AC C AG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 60 
ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 

CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 180 



30 



GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 24 0 



ACTGACGACA ACAATGAAAA GAAGAAGATA AGGTCGGATC CGTGTCATTT TTGCCCTTGA 3 00 

3 5 GTTTTCCTAT ATAAGGAACC AAGTTCGGCA TTTGTGAAAA CAAGAAAAAA TTTGGTGTAA 360 



GCTATTTTCT TTGAAGTACT GAGGATACAA GTTCAGAGAA ATTTGTAAGT TTGAATTC 418 



(2) INFORMATION FOR SEQ ID NO: 16 
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10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



i_Jl5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Gfi TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 60 

s™ ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 

H 20 

= l CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 180 

™ GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 240 

O 

2 5 ACTGACGACA ACAATGAAAA GAAGAAGATA AGGTCGGTGA TTGTGAAAGA GACATAGAGG 300 

ATCCGTGTCA TTTTTGCCCT TGAGTTTTCC TATATAAGGA ACCAAGTTCG GCATTTGTGA 360 
AAACAAGAAA AAATTTGGTG TAAGCTATTT TCTTTGAAGT ACTGAGGATA CAAGTTCAGA 42 0 

30 

GAAATTTGTA AGTTTGAATT C 441 
(2) INFORMATION FOR SEQ ID NO: 17: 

»^ 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs - 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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<ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TCTAGACCAG AAGGTAATTA TCCAAGATGT AGCATCAAGA ATCCAATGTT TACGGGAAAA 6 0 

ACTATGGAAG TATTATGTGA GCTCAGCAAG AAGCAGATCA ATATGCGGCA CATATGCAAC 12 0 

CTATGTTCAA AAATGAAGAA TGTACAGATA CAAGATCCTA TACTGCCAGA ATACGAAGAA 180 
GAATACGTAG AAATTGAAAA AGAAGAACCA GGCGAAGAAA AGAATCTTGA AGACGTAAGC 240 
ACTGACGACA ACAATGAAAA GAAGAAGATA AGGTCGGTGA TTGTGAAAGA GACATAGAGG 300 
ACACATGTAA GGTGGAAAAT GTAAGGGCGG AAAGGATCCG TGTCATTTTT GCCCTTGAGT 360 
TTTCCTATAT AAGGAACCAA GTTCGGCATT TGTGAAAACA AGAAAAAATT TGGTGTAAGC 42 0 

TATTTTCTTT GAAGTACTGA GGATACAAGT TCAGAGAAAT TTGTAAGTTT GAATTC 476 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ACCGGTACCA GAAGGTAATT ATCCAAGATG T 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

5 CGGAATTCAA ACTTACAAAT TTCTCTGAAG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
0 (A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
5 CGCGATCCAG ACTGAATGCC CACAGGCCGT CGAG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGACGTAAGC ACTGACG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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34 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CTTATCACAA AGGAATCTTA TC 22 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTTATCACAA AGGAATCTTA TC 22 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCTCTAGACC AGAAGGTAAT TATCCAAG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
<iv) ANTI- SENSE : NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TATGGATCCT ATGTTCAAAA ATGAAG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
AAAGGATCCT GAAGACGTAA GCACTG 26 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGAGGATCCG GTCGGTGATT GTGAA 25 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
AAAGGATCCT TATCACAAAG GAATC 25 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TATGGATCCG TGTCATTTTT GCCCTTG 27 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

5 CGGAATTCAA ACTTACAAAT TTCTCTAAG 2 9 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

!Jil5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

ii 

3 (iv) ANTI- SENSE: YES 

3 20 

3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

25 TAAGGATCCT TTCCGCCCTT ACATT 25 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CATGGATCCT CTATGTCTCT TTCAC 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
ACAGGATCCG AC CTTATCTT CT 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
ACCGGATCCT CTTCTTTTCA TTGTTC 26 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TCAGGATCCT TTTCTTCGCC TGGT 24 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
5 ATAGGATCCA TATGTGCCGC ATA 23 
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